Mass signature of supernova and v T neutrinos in SuperKamiokande 

J. F. Beacom* and P. Vogelt 
Department of Physics, California Institute of Technology 
Pasadena, CA 91125, USA 
(February 25, 1998) 



OO 

On 

3 



The v^i and v T neutrinos (and their antiparticles) from a Galactic core-collapse supernova can be 
observed in a water-Cerenkov detector by the neutral-current excitation of 16 O. The number of 
events expected is several times greater than from neutral-current scattering on electrons. The 
observation of this signal would be a strong test that these neutrinos are produced in core-collapse 
supernovae, and with the right characteristics. In this paper, this signal is used as the basis for 
a technique of neutrino mass determination from a future Galactic supernova. The masses of the 
Vy, and v T neutrinos can either be measured or limited by their delay relative to the v^. neutrinos. 
By comparing to the high-statistics data instead of the theoretical expectation, much of the 
model dependence is canceled. Numerical results are presented for a future supernova at 10 kpc as 
seen in the SuperKamiokande detector. Under reasonable assumptions, and in the presence of the 
expected counting statistics, and v T masses down to about 50 eV can be simply and robustly 
determined. The signal used here is more sensitive to small neutrino masses than the signal based 
on neutrino-electron scattering. 
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I. INTRODUCTION 

When the core of a large star (M > 8M Q ) runs out 
of nuclear fuel, it collapses and forms a proto-neutron 
star with a central density well above the normal nu- 
clear density (for a review of type-II supernova the- 
ory, see Rcf. M). The total energy released in the col- 
lapse, i.e., the gravitational binding energy of the core 
(E B ~ GnMq/R with R ~ 10 km), is about 3 x 10 53 
ergs; about 99% of that is carried away by neutrinos and 
antincutrinos, the particles with the longest mean free 
path. The proto-neutron star is dense enough that neu- 
trinos diffuse outward over a timescale of several seconds, 
maintaining thermal equilibrium with the matter. When 
they are within about one mean free path of the edge, 
they escape freely, with a thermal spectrum characteris- 
tic of the surface of last scattering. The luminosities of 
the different neutrino flavors are approximately equal. 

Those flavors which interact the most with the matter 
will decouple at the largest radius and thus the lowest 
temperature. The and v T neutrinos and their an- 
tiparticles, which we collectively call v x neutrinos, have 
only neutral-current interactions with the matter, and 
therefore leave with the highest temperature, about 8 
MeV (or (E) ~ 25 MeV). The D e and v e neutrinos have 
also charged-current interactions, and so leave with lower 
temperatures, about 5 MeV ((E) ~ 16 MeV) and 3.5 
MeV ((E) ~ 11 MeV), respectively. The v e temperature 
is lower because the material is neutron-rich and thus the 
v e interact more than the v e . The observation of super- 
nova v x neutrinos would allow the details of the picture 
above to be tested. For a detailed description of the su- 
pernova neutrino emission, including the justification of 
our choice of temperatures, see Refs. (HI). 



Even after many decades of experiments, it is still not 
known whether neutrinos have mass. Results from sev- 
eral experiments strongly suggest that neutrino flavor 
mixing occurs in solar, atmospheric, and accelerator neu- 
trinos, and proof of mixing would be a proof of mass. The 
requirement that neutrinos do not overdose the universe 
gives a bound for the sum of masses of stable neutrinos 
(see [0 and references therein): 
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< 100 eV . 



(1) 



However, direct kinematic tests of neutrino mass cur- 
rently give limits for the masses compatible with the 
above cosmological bound only for the electron neu- 
trino, TO Pe < 5 eV ||. For the and v T neutrinos 
the kinematic limits far exceed the cosmological bound: 
m Ufi < 170 keV [§, and m„ T < 24 MeV §]. It is very 
unlikely that direct kinematic tests can improve these 
mass limits by the necessary orders of magnitude any 
time soon. 

As we will show in detail below, the most promis- 
ing method for determining these masses is with super- 
nova neutrinos. Even a tiny mass will make the velocity 
slightly less than for a massless neutrino, and over the 
large distance to a supernova will cause a measurable de- 
lay in the arrival time. A neutrino with a mass m (in 
eV) and energy E (in MeV) will experience an energy- 
dependent delay (in s) relative to a massless neutrino in 
traveling over a distance D (in 10 kpc) of 



At(E) 



o, 15 (D 2 D 



(2) 



where only the lowest order in the small mass has been 
kept. Since one expects one type-II supernova about 
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every 30 years in our Galaxy 0, and since supernova 
neutrino detectors are currently operating, it is worth- 
while to consider whether mass limits (or values) for v x 
compatible with the cosmological bound, Eq. (|l|) , can be 
obtained. 

The problem of v x mass determination with supernova 
neutrinos in existing (e.g., Refs. |p|-p^|) and proposed de- 
tectors (e.g., Refs. [|3|-^5[) has been considered before. 
The present work differs from the previous ones by the 
method with which the v x are detected: inelastic scatter- 
ing on 16 nuclei followed by proton or neutron emission, 
and subsequent gamma decay of excited 15 N or 15 nu- 
clei, as suggested in Ref. [ [L6| . We describe this signal 
and its time structure in Section II. In Section III we dis- 
cuss the most relevant case of small masses. We find the 
smallest v x mass that is recognizably different from zero 
in the presence of the expected finite counting statistics. 
In Section IV we show that the mass range is also lim- 
ited from above. If the v x mass is too large, the signal is 
broadened to such a degree that it disappears into the un- 
avoidable background. We find the largest detectable v x 
mass. Finally, in Section V we summarize our findings. 



II. CHARACTERISTICS OF THE MODEL 

A. Neutrino scattering rate 

We assume that the double differential number 
distribution of neutrinos of a given flavor (one of 
v ei v e , i/p, Pp, v T ,v T ) at the source can be written in the 
product form: 



d 2 N v 
dEdt, 



F(E)G(U), 



(3) 



where E is the neutrino energy and ti is the time at 
the source. The double integral of this quantity is the 
total number of emitted neutrinos of that flavor N v . 
This form assumes that the energy spectrum F{E) is 
time-independent; the time dependence of the source is 
parametrized solely by G(ti). The reasons for assum- 
ing that the energy and time dependences are separable 
will be given below. The most general form would allow 
F = F(E,ti), e.g., a time-dependent temperature. The 
luminosity is 

L(U) = J dEE^- = (E) G(t t ) J dEF(E) , (4) 

where (E) is the (time-independent) average energy. If 
the energy spectrum is normalized as 



f(E) = 



F(E) 



JdEF(E) 



then we can write 

d 2 N. 



dEdt 



T = f(E) 



L{U) 
(E) 



(5) 



(6) 



This form is convenient since we assume, as stated earlier, 
that the luminosities of the different flavors are approxi- 
mately equal at every time ij. The energy spectrum f(E) 
will be taken to be thermal, and the luminosity L(ti) will 
be taken to have a very sharp rise and an exponential de- 
cline. The arrival time of a neutrino of mass m at the 
detector is t = U + D + At(E) 1 where D is the distance 
to the source, and the energy-dependent time delay is 
given by Eq. (g). For convenience, we drop the constant 
D. Then the double differential number distribution of 
neutrinos at the detector is given by 



d 2 N v 
dEdt 



= / dt 



d 2 N v 



S(t -U- At{E)) 



f(E) 



dEdt_ 
L(t-At(E)) 

W) 



(7) 



Note that because of the mass effects, this is no longer the 
product of a function of energy alone and a function of 
time alone. The number flux of neutrinos at the detector 
is obtained by dividing this by AirD 2 . The scattering rate 
for a given neutrino reaction is then 



dN st 
dt 



N H20 n / dEa(E) 



1 d 2 N v 
4ttD 2 dEdt 



(8) 



where Nh 2 o is the number of water molecules in the de- 
tector, a(E) the cross section for a neutrino of energy E 
on the target particle, and n the number of targets per 
water molecule for the given reaction. Using the results 
above, 



dN sl 
dt 



= N H , 



1 



1 



AttD 2 (E) 



dEa(E)f(E)L(t - At(E)) . 

(9) 



In more convenient units, the scattering rate (per s) is: 



dN sl 
dt 



where 



C = 9.21 



C J dEf(E) 



a(E) 



L(t-At(E)) 



E B /6 



(10) 



Eb 



10 



53 



ergs 



1 MeV 



T 



10 kpc 



D 



det. mass 



1 kton 



(11) 



T is the spectrum temperature (where we assume (E) = 
3.15T, as appropriate for a Fermi-Dirac spectrum), and 
f(E) is in MeV -1 . Since the luminosities are equal for 
each flavor, the total binding energy released in a given 
flavor is Eb /6 (we ignore the small effect associated with 
the neutronization burst). When an integral over all ar- 
rival times is made, the luminosity term in square brack- 
ets integrates to one, giving for the total number of scat- 
tering events: 
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AT.,, 



C / dEf(E) 



a(E) 



(12) 



The formulae in this section were derived for a nonzero 
neutrino mass; for massless neutrinos, simply take 
At(E) — throughout. In particular, note that in 
Eq. (|1C|) the luminosity term then can be taken outside 
of the integral, making the time dependence of the scat- 
tering rate simply a constant times the time dependence 
of the luminosity. 



B. Details of the model 

As noted above, we assume that the energy distribu- 
tion for a given flavor of neutrinos is time-independent, 
e.g., that the temperature does not vary with time. While 
the temperature really will vary with time, the variation 
is probably not large (see e.g., Fig. 3 of Ref. 0, but note 
that those "average" energies are defined &s\E 2 ) / (E)) . 
Also, recent numerical models of supernovae disagree on 
the form of the variation, and even whether it is rising or 
falling. A well-motivated form for temperature variation 
may eventually be obtained from the supernova v e data 
or from more-developed numerical models. The analysis 
of this paper could be easily modified to allow a varying 
temperature; until there is a compelling reason to use a 
particular form, we simply use a constant temperature. 

The energy distribution is taken to be a Fermi-Dirac 
distribution, characterized only by a temperature. We 
take T = 8 MeV for v x , T = 5 MeV for v e , and T = 3.5 
MeV for v e . These temperatures are consistent with nu- 
merical models, e.g., in Ref. ||. More elaborate mod- 
els also introduce a chemical potential parameter to re- 
duce the high-energy tail of the Fermi-Dirac distribution. 
That reduces the number of scattering events, but makes 
the dominant contribution to the cross section occur at 
a lower neutrino energy, thus giving a larger delay. 

Numerical supernova models suggest that the neutrino 
luminosity rises quickly over a time of order 0.1 s, and 
then falls over a time of order several seconds. There- 
fore, the luminosity used in our numerical simulation is 
composed of two pieces. The first gives a very short rise 
from zero to the full height over a time 0.09 s, using one 
side of a Gaussian with a — 0.03 s. The rise is so fast 
that the details of its shape are irrelevant. The second 
piece is an exponential decay with time constant r = 3 s. 
The luminosity then has a width of 10 s or so, consistent 
with the SN 1987A observations. The detailed form of 
the neutrino luminosity is less important than the gen- 
eral shape features and their characteristic durations. In 
Ref. the neutrino luminosity actually decreases as a 
power law, and does so somewhat faster than our expo- 
nential. The slower the decay, the harder it is to see mass 
effects, so our choice is actually somewhat conservative. 

Throughout the paper, we assume that the distance to 
the supernova is D = 10 kpc, approximately the distance 
to the Galactic center. 



C. Characteristics of SuperKamiokande 

In this paper, all of the results are for the Su- 
perKamiokande (SK) detector. The analysis here could 
be easily applied to any water- Cerenkov detector. Its 
large size, low threshold, and low background rate make 
it very well-suited to detect a Galactic supernova. We as- 
sume an energy threshold of 5 MeV; presently it is a little 
bit higher but has been lowered a few times. The full vol- 
ume of the main tank is 32 kton. From SK conference 
talks [jl7| , we estimate the time- independent background 
rate for the inner fiducial 22.5 kton volume to be about 
0.1 s^ 1 for a threshold of 5 MeV. For the full 32 kton 
volume, we estimate that the background rate can be no 
more than several times worse than 0.1 s _1 , again for a 
threshold of 5 MeV. For the low-mass search in Section 
III, we assume that the full 32 kton volume is used. The 
exact value of the time-independent background rate is 
completely irrelevant in that search. For the high-mass 
search in Section IV, we assume that only the inner 22.5 
kton will be used, since in that case the time-independent 
background rate would be an important factor. Using 
only the inner volume will decrease the number of signal 
events by a factor 1.4, while decreasing the background 
by a factor of at least a few. 



D. Description of the signal 

The cross section for the neutral-current excitation of 
16 O by neutrinos was computed numerically in Ref. [fl6|| . 
It was assumed to be a two-step process, of excitation 
of 16 O to the continuum, followed by decays into var- 
ious final states. The principal branches in this decay 
are to states of 15 N + p and 15 + n. For v x neutrinos 
with a thermal spectrum with T = 8 MeV, the combined 
branching ratio for these final states is about 95%. If the 
decay is to a bound excited state of the daughter nucleus, 
then the daughter will decay by gamma emission. At the 
relevant excitation energies in 16 O, the branching ratio 
to these states in the daughters is about 30%. The cru- 
cial point is that in both 15 N and 15 all gamma rays lie 
between 5 and 10 MeV and can thus be detected in SK. 
The other 70% of the branching ratio involves decays to 
the ground state of the daughters without gamma emis- 
sion. In order to get to a final state with a gamma, the 
neutrino energy must be greater than about 20 MeV. Be- 
cause of this high threshold, and because of the lower v e 
and D e temperatures, these reactions contribute only at 
the 2% level compared to the v x reactions, and hence are 
ignored fill. 

In Refe. | ]l6| , |l8t , the neutral-current cross sections were 
calculated numerically and folded with thermal neutrino 
spectra of different temperatures. For the present pur- 
pose we need the cross section for a given neutrino en- 
ergy. It turns out that the simple parameterized form 
cr(E) = (7q(E— 15) 4 , with the neutrino energy E in MeV 
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and (Jo = 0.75 x 10 -47 cm 2 describes quite well the cross 
section for a neutrino to excite 16 O. In the fit we as- 
sumed that the branching ratio for states that end with 
gamma emission is independent of neutrino energy. All 
such branches are included in this cross section above, 
and we have summed the cross sections for neutrinos and 
antineutrinos (for just one flavor), as well as both final 
channels. The fit values agree with the numerical cal- 
culations at the 10% level over four orders of magnitude 
in the thermally-averaged excitation cross section. This 
fit will certainly not hold at higher energies which are 
however irrelevant in the present context. 

In order to estimate the delay, Eq. (|2|) can be evalu- 
ated with a typical neutrino energy. However, one should 
not use the average energy, (E) = 25 MeV. Rather, one 
should use the energy for which f(E)a(E) peaks. For this 
reaction, this "Gamow peak" energy is E ss 60 MeV, i.e., 
considerably larger than (E) . The fact that the neutrinos 
have a spectrum of energies means that different values of 
E contribute to the time delay, causing dispersion of the 
neutrino pulse as it travels from the supernova. It turns 
out that for the small masses we are primarily interested 
in these dispersive effects are minimal. 

The signal associated with the gamma emission de- 
scribed above will not be the dominant signal of a Galac- 
tic supernova in SuperKamiokande. Rather, the domi- 
nant events will be the positrons from v e + p — > e + + n, 
which give a smooth continuum in positron energy, peak- 
ing at about 20 MeV. The expected numbers of events 
for various reactions were calculated with Eq. ( |l2| ) and 
are given in Table I. For the v e absorption on proton re- 
action, recoil and weak magnetism effects were taken into 
account, which slightly reduces the cross section. There 
are also charged-current reactions on 16 O these in- 
crease the dominant positron signal by about 1%. Since 
events from the electron-scattering channels are forward- 
peaked, we assume that they are removed by an angular 
cut. Therefore, in our analysis we use only the events 
from P e absorption on protons and the v x excitation of 
16 0. 

The gammas from the neutral-current reactions above 
are at several discrete energies ranging from 5.2 MeV to 
9.9 MeV. These are subject to some smearing, due to the 
finite resolution, giving few narrow peaks on top of the 
smooth distribution of positrons as shown in Fig. 2 of 
Ref. fll6[ . For simplicity, we treat the energy range from 
threshold to 10 MeV as one bin, and assume that losses 
due to the threshold or efficiency are minimal. 

In Ref. |]l6|| numbers of events from different reactions 
were calculated relative to each other, with the overall 
scale set by the total number of u e events from Ref. [^0) . 
However, the number of v e events corresponding to T = 3 
MeV from Ref. |2rj] was used. This was not really con- 
sistent, and would not be consistent here either, since for 
the v e neutrinos, T = 5 MeV is assumed here and in 
Ref. |Q. Consequently, we use instead Eq. (|lj) to cal- 
culate the number of events for T = 5 MeV. We verified 
that the rates based on Eq. (|l2|) agree with the numbers 



TABLE I. Calculated numbers of events expected in SK 
with a 5 MeV threshold and a supernova at 10 kpc. The other 
parameters (e.g., neutrino spectrum temperatures) are given 
in the text. In rows with two reactions listed, the number of 
events is the total for both. The second row is a subset of the 
first row that is an irreducible background to the reactions in 
the third and fourth rows. 



reaction 


number of events 


+ p — > e + + n 


8300 


v e +P^ e+ + n {E e+ < 10 MeV) 


530 


vy, + 1D O -> v„ + 7 + X 
£V+ 16 0->^ M + 7 + X 


355 


Vr+ 10 O^V T +J + X 

v T + 10 O -> v T + 7 + X 


355 


v e + e~ — > v e + e~ 
i> e + e~ — > v e + e~ 


200 


+ e~ — > + e~ 
Pfi + e~ — > + e~ 


60 


v T + e~ — > v r + e~~ 
v T + e~ — > v T + e~ 


60 



given in Ref. [|2C)[ when consistent temperatures are used. 
Note that the results of Ref. jl6| are changed only by in- 
creasing the number of events in each reaction by a factor 
of about 2. 



III. LOW-MASS CASE 

In this section, we detail the strategy used in the anal- 
ysis. First, the v e mass is low enough that it can be 
neglected. Our final result is that one can reach sensitiv- 
ity down to a v x mass of about 50 eV. Since the v e mass 
is at least 10 times smaller, and since the delay depends 
quadratically on the mass, this neglect is justified. This 
establishes the key point of our technique: that we can 
use the D e events as a clock by which to measure the pos- 
sible delay of the v x neutrinos. Under our assumption 
that the temperatures are approximately constant, the 
only time dependence of the v e scattering rate is from 
the v e luminosity itself (see Eq. ( |To| ) with m = 0). In 
contrast, the time dependence of the v x scattering rate is 
determined both by the v x luminosity and the delaying 
effects of a possible mass. Thus the effects of a mass can 
be tested for by comparing the scattering rates of the v e 
and v x events as a function of time. In other words, we 
are looking for time dependence in the v x rate beyond 
that expected from the luminosity variation alone. In 
order to implement this, we define two rates, as follows. 

The scattering rate of v e events with E e + > 10 
MeV will be called the Reference R(t). This contains 
w 8300 — 530 w 7800 events. The time dependence of 
R{t) is completely determined by the time dependence 
of the luminosity. Its shape is generic for massless neu- 
trinos. The Signal S(t) has three components. The first 
is the scattering rate for the 355 events from the com- 



4 



bined and on 16 O reactions. The second is the 
same for the 355 combined v T and v T events. The third 
is the scattering rate for the 530 v e events with E e + < 10 
McV. We will assume that some portion of the Signal 
S(t) events are massive (either all v T events or all f M and 
v T events). All of the other events in S(t) are then mass- 
less background events. Because some of the S(t) events 
will be massive, the shape of S(t) will be distorted. In 
particular, it will be delayed and broadened. 

In a given experiment (i.e., one supernova), the Sig- 
nal S(t) and the Reference R(t) will be measured. In 
order to facilitate comparison of their shapes, the curve 
R(t) can be scaled down to the number of events in S(t). 
The curve S(t) shows how the data look, with a possible 
unknown v T mass, and the curve R(t) shows how they 
would look if all of the events were massless. The rates 
are shown in Fig. 1, which depicts S(t) under different 
assumptions about the v T mass. The shape of R(t) is the 
same as that of S(t) when m„ T = 0. The curve R(t) will 
be measured, and so will be there to compare the mea- 
sured S(t) to. As the v T mass is increased, the delayed 
v T events separate from the massless events more and 
more. For m = 125 eV, the scattering rate over the first 
1 s or so is just that from the remaining massless events. 
The effect of a mass is to diminish the rate at early times 
and enhance it at late times (since the normalization is 
preserved, these are roughly equivalent statements). 

In a real experiment, statistical fluctuations will mask 
the effect of a mass. The Reference R(t) contains ap- 
proximately 7800 events, and thus has small relative 
fluctuations. The Signal S(t) contains approximately 
1240 events, and therefore has larger relative fluctua- 
tions. Each of those curves is subject to fluctuations 
in the total number of events as well as fluctuations in 
any time interval. Consider for a moment the events in 
the first 1 s of Fig. 1. There are 336 events expected in 
the m„ T = eV case, and 302 events expected in the 
m„ T = 50 eV case. As noted, the Reference has smaller 
fluctuations, so for now take the total 336 as exact. The 
countin g er ror on the Signal in that interval will be of 
order \/302 w 17. If the number of events in this bin 
fluctuates up by about two sigma, then the number of 
events in the Signal over this interval would match the 
number expected for the massless case, and we would 
have to conclude that most probably the mass of the v T 
is zero. In the analysis below, we use much more of the 
data, but the idea is the same: it is possible for one mass 
case to fake another through fluctuations. The degree to 
which this can occur depends primarily on the number of 
events expected in the Signal. We will restrict the range 
of fluctuations that we consider to be likely by choosing 
confidence levels. 

To treat the expected fluctuations properly, we use a 
Monte Carlo technique to generate representative statis- 
tical instances of the theoretical forms for R(t) and S(t). 
Each run represents one supernova as seen in SK. The to- 
tal number of events expected in R(t) is known. In each 
particular run, this total is subject to Poisson fluctua- 




4 5 
t(s) 

FIG. 1. The expected event rate in the absence of fluctua- 
tions for the signal S(t) is shown for different v T masses, as 
follows: solid line, eV; dotted line, 50 eV; dashed line, 75 
eV; dot-dashed line, 125 eV. Of 1240 total events, 530 are 
massless v^, 355 are massless z/ M and Dp , and 355 are mas- 
sive v T and v T . These totals count events at all times; in the 
figure, only those with t < 9 s are shown. 



tions. We model this by picking a Poisson random num- 
ber from a distribution with mean given by the expected 
number of events. This gives the number of events for 
this particular run. We then use an acceptance-rejection 
method to sample the form R(t) until the right number 
of events for that run is obtained. This gives a statistical 
instance of R(t), typical of what might be seen in a sin- 
gle experiment. Then an exactly analogous technique is 
used to generate the total number of events in S(t) and a 
statistical instance of the curve S(t) itself. The massless 
and massive components of S(t) are sampled separately, 
and are then added together. 

One comment on the method of sampling is necessary. 
No matter how the generated rates are binned in time, 
this method ensures that in each bin there are the cor- 
rect Poisson fluctuations around the expected number in 
that bin. Therefore, this technique is equivalent to the 
sometimes-seen technique of first establishing bins and 
the expected number in each bin, and then picking a rep- 
resentative number of events for that bin according to the 
appropriate Poisson distribution. However, our method 
of generating representative Signal and Reference data 
sets does not require binning. If analysis of these data 
sets uses bins, the bin size can be changed without re- 
generating the data. 
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Both of the tests developed below depend upon the 
shape of S(t), and not directly on the number of counts. 
Direct tests for an excess or deficit of counts are much 
more dependent on theory; this dependence is largely 
canceled in our approaches. 



A. x analysis 

As discussed above, the presence of a mass in the Sig- 
nal S(t) will cause its relative decrease at early times 
and relative increase at late times in comparison with 
the Reference R(t). Whether or not it can be seen is a 
question of the statistics of the event rates. As a first test 
we look for a shape distortion in S(t) relative to R(t) by 
making a x 2 test. If the x 2 P er degree of freedom (d.o.f.) 
is of order unity, then the two curves are compatible at 
the level of the errors, and there is no reason to invoke 
a mass. If the x 2 /d.o.f. is large, then the two functions 
are incompatible, which we take as evidence for a mass. 
That is, we assume that there are no other systematic 
effects which would give a large x 2 /d.o.f.; one always has 
to make some such assumption. 

We first scale the Reference down to the number of 
events observed in the Signal over the range < t < t max . 
As required for a x 2 test, both sets are then binned so 
the continuous functions R(t) and S(t) are replaced by 
discrete representations. Bins of constant width St are 
used. The scaling is given by 



Rj - R 



^N b 



RjSt 



(13) 



where t, 



NtinSt. When m = 0, Rj = Sj, up to 



statistical fluctuations. The x is formed as follows: 



X 2 /d.o.f. 



Nbin — 



— Y 



{Rjdt - SjStf 



j=i sRjSt + Sj5t 



(14) 



where j runs over the bins used. The number of degrees 
of freedom is reduced by one because we have normalized 
the Reference to the Signal. The factor s is the ratio of 
the total numbers of events (in < t < t max ) in the Sig- 
nal and the Reference, and is computed for each run in 
the Monte Carlo. Even though RjSt as SjSt, the fluctua- 
tions in RjSt are much smaller since RjSt is scaled down 
from RjSt, which has high statistics. 

It is important to stress that it is not enough to eval- 
uate the x 2 using the predicted curves for R(t) and S(t ) 
based on the analytic forms constructed with Eq. (10). 
Doing so neglects fluctuations, and always underesti- 
mates the x 2 1 particularly near the small-mass limit 
that we are interested in (since in the massless case 
R[t) — S(t)). Roughly speaking, using the exact func- 
tions themselves in the x 2 underestimates the x 2 /d-0.f. 
by about unity, and of course does not give the error. 



As explained above, we use the Monte Carlo technique 
which properly treats statistical fluctuations, and leads 
to a more conservative (and correct) mass limit. 

Only a finite range of times was used in forming the 
X 2 ■ The beginning of the first bin is taken to be where 
the events start. With some 9000 total events expected, 
and a risetime of order 0.1 s, the starting time can be 
reasonably well-defined. In the Monte Carlo, the starting 
time was held fixed (and not adjusted from the data on 
each run). The definition used amounts to calling the 
starting time that point at which the v e rate is about 1% 
of its peak rate. The size of any ambiguity in the starting 
time is much smaller than the bin size (discussed below) , 
and so is regarded as irrelevant. 

The ending time and the bin size must be chosen more 
carefully. The primary consideration is to maximize the 
extraction of the mass effect in the presence of the statis- 
tical fluctuations. Further, this must be optimized for the 
case of a small mass (other cases are discussed below). In 
Fig. 1, one can see that for a given m VTl the Signal S(t) 
rejoins the Reference R(t) at very late times (even be- 
yond the edge of the figure for the larger masses). Once 
this has happened, there is no benefit to going to larger 
times; in fact, one only includes more statistical noise by 
doing so. In the Monte Carlo studies, it was found that 
tmax = 9 s and a bin size of 5t = 1 s were good choices 
(the final results are only weakly dependent on these). 
These are also very reasonable from a physical point of 
view. These have to be held fixed for all of the Monte 
Carlo runs, since one cannot adjust these to a particular 
data set without introducing bias. These choices also en- 
sure that we can completely neglect the time-independent 
background rate of at most a few times 0.1 s _1 . 

With these choices, one has a reasonable number 
(namely 8) of degrees of freedom in the x 2 , and a large 
number of events expected in each bin. The latter ensures 
that the Poisson errors on the counts in each bin really 
are approximately Gaussian, as required in the x 2 defi- 
nition. Up to fluctuations, the late-time bins all have an 
excess. Combining them would enhance the significance 
of this excess, whereas for random fluctuations combin- 
ing bins does not change the significance. The same is 
true for the early-time deficits. However, one does not 
in general know where the transition point is between 
these two regions; that is determined by the unknown 
mass. The transition point cannot be determined from 
the data without introducing bias. Also, with too few 
bins, one does not satisfy the requirements for defining a 
X 2 test. 

Using the above procedure for analyzing each run (and 
in particular, normalizing R(t) to S(t) over < t < 
tmax), we used the Monte Carlo to simulate the results 
from 10 4 supernovae. For each run, the x 2 analysis was 
performed. For each fixed mass, a variety of x 2 values are 
obtained, due to the finite statistics in the Reference and 
the Signal. These results were histogrammed as x 2 /d.o.f. 
The relative frequencies of different x 2 /d.o.f. values are 
shown in the upper panel of Fig. 2 for a few represen- 
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FIG. 2. The results of the \ 2 analysis for a massive v T . In 
the upper panel, the relative frequencies of various x 2 /d-0.f. 
values are shown for a few example masses. In the lower 
panel, the range of masses corresponding to a given xVd.o.f. 
is shown. The solid line is the 50% confidence level, and the 
upper and lower dashed lines are the 10% and 90% confidence 
levels, respectively. In this figure, t max = 9 s, the bin size used 
in the \ 2 is <5t = 1 s, and the time constant of the exponential 
luminosity is r = 3 s. 



tative masses. (Note that the number of Monte Carlo 
runs determines only how smoothly these distributions 
are filled out; their shape and placement is determined 
by the physics.) For m = 0, the resulting x 2 /d.o.f. values 
of course fill out the usual x 2 /d.o.f. distribution with 8 
degrees of freedom. 

These distributions are characterized by their central 
point and their (asymmetric) width, using the 10%, 50%, 
and 90% confidence levels. That is, for each mass we de- 
termined the value of x 2 /d.o.f. such that a given percent- 
age of the Monte Carlo runs yielded a value of x 2 /d.o.f. 
less than that value. With those three numbers, we 
can characterize the results of complete runs with many 
masses much more compactly, as shown in the lower 
panel. For convenience, the axes are inverted from how 
the plot was actually constructed. That is, given the 
X 2 /d.o.f. which will be experimentally determined, one 
can read off the range of masses that could have likely 
given such a x 2 /d.o.f. at these confidence levels. 



B. (t) analysis 

The x 2 test above has the nice feature that it is a shape 
test, and depends on the number of events only through 
the fluctuations. One disadvantage is its dependence on 
binning, which obscures changes over timescales smaller 
than of order the bin width, i.e., the effects of sufficiently 
small masses. Another is that the mass effect is not al- 
ways in the same sense. At early times there is a deficit 
of events, whereas at late times there is an excess; the 
X 2 is insensitive to the difference between this distinctive 
feature and random fluctuations of similar magnitude. 
To get around these problems, we introduce here tests 
of integral moments. These do not involve any binning. 
The most basic effect of a mass is a delay; the average ar- 
rival time always increases. The test is simple, intuitively 
obvious, and the effect is always in the same sense (up 
to statistical fluctuations). A mathematically analogous 
moments analysis was made for electron recoil energies 
in the context of solar neutrino oscillations in Ref. plf . 

Given the Reference R(t), the average arrival time is 
defined as 



Ekh _ Jp ma *dttR(t) 

Ek 1 J* ma *dtR(t) 



(15) 



The summation form is used for the Monte Carlo gener- 
ated data sets, where the sum is over events (not time 
bins) in the Reference with < t < t max . The integral 
form would be used if the theoretical forms for the rates 
were given. It is no longer necessary to normalize the 
Reference to the Signal. As with the % 2 test, the start- 
ing time is assumed to be well-defined. The choice of 
tmax follows from similar considerations as before. The 
effect of the finite number of counts in R(t) is to give (t)n 
a statistical error. This error is the intrinsic width of the 
R(t) distribution divided by the square root of the num- 
ber of events in the Reference. Both the intrinsic width 
and number of events depend on the choice of t max . By 
choosing a moderate t max , the intrinsic width of R(t) can 
be restricted even while most events are included. 

Given the Signal S(t), the average arrival time is de- 
fined similarly as 



T, k t k _ Jp— dttsjt) 
E*i J- W dts(t) ' 



(16) 



where naturally the sums are now over events in the Sig- 
nal. While the intrinsic widths of R(t) and S(t) are sim- 
ilar, the statistical error on (t)s is larger by factor of a 
few since there are several times fewer events. The effect 
of the mass is to make (t)s larger, i.e., to cause a de- 
lay. (The mass increases the intrinsic width of S(t) only 
slightly.) 

In order to cancel some systematic effects, we consider 
not (t)s as compared to theory but the difference (t)s — 
(t) r determined from the data. The signal of a mass is 
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that this is greater than zero with statistical significance. 
From the Monte Carlo studies, t max = 9 s was found to 
be a very reasonable choice; most of the data are then 
included while the range is kept small. For this t maxi 
the time-independent background events are negligible. 
Again, while these choices are somewhat optimal, the 
final results are not strongly dependent on the particular 
values used as long as they are reasonable. Although 
the values of (t) depend on t maxi the dependence is not 
strong. For t max = 9 s, a change of 0.1 s in t max gives a 
change of about 0.01 s in (t). Note that any shift in the 
starting time will cancel in the difference (t)s — (t) R (as 
long as it doesn't change the numbers of events included). 

Using the above procedure for analyzing a particular 
run, we again used the Monte Carlo to simulate the re- 
sults from 10 4 supernovae. Basically, things were done 
as above for x 2 . For each run, (t)s — (t) R was calcu- 
lated and its value histogrammed. These distributions 
are again characterized by their central point and their 
width, using the 10%, 50% (now also the average), and 
90% confidence levels. That is, for each mass we deter- 
mined the values of (t) s — (t) R such that a given percent- 
age of the Monte Carlo runs yielded a value of (t) 5 — (t) R 
less than that value. Since these distributions arc Gaus- 
sians, other confidence levels can easily be constructed. 
The results of this analysis are shown in Fig. 3, which is 
analogous to Fig. 2. 

For t max = 9 s, (t) R = 2.57 s. For larger t max , (t) R 
tends to about 3 s, the value of the exponential time 
constant in the luminosity. The value of (t) s is of course 
larger by the mass effect. As noted, the error on each 
moment is the intrinsic width divided by the square root 
of the number of events. The intrinsic widths of the R(t) 
and S(t) distributions are each of order a few seconds. 
The numbers of events are of order 8000 and 1200, re- 
spectively. Note that the errors on (t) R and (t)s are 
uncorrelated. 

We also investigated the dispersion of the event rate in 
time as a measure of the mass. As noted above, a mass 
alone causes a delay, but a mass and an energy spec- 
tr um also ca use dispersion. We defined the dispersion as 
\J (t 2 ) — (t) 2 , where all integrals are as above defined up 
to t max . We found that the effects were not statistically 
significant until the mass was of order 150 eV or so; at 
such a large mass the statistical significance of the change 
in (t) cannot be missed. 

C. Comparison of techniques 

The analysis techniques presented above are appropri- 
ate for the case in which the mass is either small or zero. 
In this case, the signal S(t) and the reference R{t) are not 
easily distinguished for finite statistics. Both the x 2 and 
(t) analyses were optimized for this case by choosing a 
moderate t max = 9 s, which also allowed us to neglect the 
time-independent background. In each case, sensitivity 
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<t> s -<t> R (s) 
FIG. 3. The results of the (£) analysis for a massive v T . In 
the upper panel, the relative frequencies of various (t}s — (£) r 
values are shown for a few example masses. In the lower 
panel, the range of masses corresponding to a given (t)s — (t)« 
is shown. The solid line is the 50% confidence level, and the 
upper and lower dashed lines are the 10% and 90% confidence 
levels, respectively. In this figure, t max = 9 s and the time 
constant of the exponential luminosity is r = 3 s. 

to a v T mass of about 50 eV was found. This is essen- 
tially the mass which cannot be missed even if there are 
unfavorable statistical fluctuations. Since the mass ef- 
fects grow quadratically, for larger masses the statistical 
significance of the mass effects would be huge. 

At a given mass, the ranges of \ 2 or (t) s — (t) R values 
shown in the figures are the ranges of probable values 
that would be seen in one experiment (i.e., one super- 
nova). Those ranges are the result of properly taking 
into account the expected statistical fluctuations of the 
Reference and Signal (while the Signal error dominates, 
the Reference error was included in the calculations). For 
a given experiment, the values of x 2 and (t)s — (t) R can 
be computed from the data. The statistical errors on 
those quantities can also be estimated from the data, 
and should be similar to what is shown in the figures. 

The results from both analysis techniques are essen- 
tially similar. That is, the final results are not strongly 
dependent on the statistical technique used, which is cru- 
cial. Of course, the results from the (t) analysis arc 
slightly better, for the reasons explained above. The fi- 
nal figures for the (t) analysis also allow other confidence 
levels to be constructed easily. Our \ 2 t est was designed 
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TABLE II. The results of the \ 2 analysis for different cases. The decay constant of the exponential luminosity is denoted by 
r. If the masses are zero, the most probable xVd.o.f. = 1. For this xVd.o.f., the allowed mass ranges are given in the second 
column; the lower limit of zero is the most probable mass, and the upper limit is excluded at the 90% confidence level. The 
smallest value of xVd.o.f. not compatible with m = is xVd.o.f. = 1.7. The corresponding allowed mass ranges are given in 
the third column; both the upper and lower limits are excluded at the 90% confidence level. The most probable mass is given 
in parentheses. 



case 


result for xVd.o.f. = 1 


result for xVd.o.f. = 1.7 


r = 3s; m v ^ = 0, m„ T = m 


< m < 60 eV 


< m < 75 eV (m ~ 55 eV) 


t — 3s; m Vtl = m„ T = m 


< m < 40 eV 


< m < 50 eV (m ~ 40 eV) 


r — Is; m Vll = 0, m„ T = m 


< m < 35 eV 


< m < 45 eV (m ~ 30 eV) 



TABLE III. The results of the (t) analysis for different cases. The decay constant of the exponential luminosity is denoted 
by r. If the masses are zero, the most probable {t)s — (£)i? = 0. For this {t)s — (t)R, the allowed mass ranges are given in the 
second column; the lower limit of zero is the most probable mass, and the upper limit is excluded at the 90% confidence level. 
The smallest value of (t)s — (£)ij not compatible with m — is {t)s — = 0.09 s. The corresponding allowed mass ranges 
are given in the third column; both the upper and lower limits are excluded at the 90% confidence level. The most probable 
mass is given in parentheses. For the third case, because of the reduced width of the pulse, 0.03 s is used instead of 0.09 s. 



case 


result for (t) s - (t) R = 


result for (t) s - (t) R = 0.09 s 


r = 3s; m„ w = 0, m„ T = m 


< m < 45 eV 


< m < 70 eV (m ~ 45 eV) 


t = 3s; m Vfi = m„ T = m 


< m < 35 eV 


< m < 45 eV (m ~ 35 eV) 


t — Is; m Vtt = 0, m„ T = m 


< m < 25 eV 


< m < 40 eV (m ~ 25 eV) 



to ask if there was evidence for a nonzero mass, the ev- 
idence being a large x 2 - Strictly speaking, if there was 
such evidence the mass would not be determined with 
that test; one would reformulate the Reference to in- 
clude a mass and would define a new x 2 , which would be 
minimized with respect to the mass. Nevertheless, our 
formulation works reasonably for small masses. Finally, 
because of its greater convenience in use and interpreta- 
tion, as well as its greater sensitivity, we advocate the (t) 
technique. 

We also considered the case in which both the and 
the v T are massive. For convenience, we took m„ M = m„ T ■ 
(Since the time delay is quadratic in the mass, there is 
little difference from the one-mass case unless the masses 
are similar.) The results of the x 2 analysis are shown in 
Fig. 4, and the results of the (t) analysis are shown in 
Fig. 5. As expected, with a better proportion of massive 
events in the Signal, lower masses can be probed. All of 
the results are summarized in Tables II and III. 



D. Comparison to previous work 

Various techniques for determining or limiting the 
and v T masses from observations of supernova neutrinos 
have been proposed. Any such technique must be based 
on a neutral-current signal and by necessity will contain 
events from other reactions with similar signatures but 
caused by v e or v e . Also, in neutral current events, one 
cannot determine the initial neutrino energy on the event 
by event basis. (In the neutrino-electron scattering that 
is possible in principle, but not in practice). Hence, one 



cannot directly determine the energy spectra of the in- 
coming v x neutrinos. 

The most developed technique uses the signal from 
neutrino-electron scattering in SK. All flavors participate 
in this reaction, which has no threshold. Even though the 
Vfj, and v T energies are higher, their thermally- averaged 
cross sections are smaller than for v e and v e (which also 
have a charged-current channel). Thus massless events 
are necessarily part of the irreducible background. There 
are also isotropic background events from the copious 
Ve. + P — > e + + n reaction; by considering only events in 
the forward cone of half-angle about 25 degrees (deter- 
mined by the angular resolution of the Cerenkov detec- 
tor), one can eliminate about 95% of the isotropic back- 
ground Ji|]{L2|. From Table I it follows that if just the v T 
is massive, in the forward cone there are about 700 m = 
events and about 60 m > events. The test for a mass 
is to check whether the events in the forward cone fall off 
more slowly in time than those outside the cone. Since 
the number of massive events is small, one has to look 
for a large delay. At such late times in the tails of the 
scattering rates, the time-independent background rate 
is not at all negligible. 

The most detailed analysis of the neutrino-electron 
scattering case was given in Ref. Q ■ The statistical test 
for a mass was done by a complicated likelihood match- 
ing scheme, and sensitivity to a mass of about 50 eV was 
found. Another detailed analysis was given in Ref. p2[ . 
The statistical test for a mass was simple, and was based 
on looking for an excess of events at late times, where 
an excess was defined as three times the Poisson error. 
In this case, sensitivity to a mass of only about 150 eV 
was found. The question arises if this poorer limit was 
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FIG. 4. The results of the x 2 analysis for a massive and 
v T , taken to have the same mass. The figure is otherwise the 
same as Fig. 2. 
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FIG. 5. The results of the (t) analysis for a massive and 
v T , taken to have the same mass. The figure is otherwise the 
same as Fig. 3. 



caused by the less sophisticated statistical technique. In- 
terestingly, it is not. It is pointed out in Ref Q that 
the authors of Ref. |L0| use a luminosity which decays 
roughly exponentially with a time constant of r = 1 s 
(in contrast to the time constant of r = 3 s used in this 
work). Such a sharp time distribution makes distinguish- 
ing the effects of a mass much easier. In Ref. p2| , it is 
shown that using such a quickly-decaying luminosity and 
the same simple statistical technique that sensitivity to 
50 eV can also be obtained. 

For comparison, we set the exponential time constant 
in the luminosity to r = 1 s and repeated our analysis. 
For t max = 3 s and a bin size of 0.5 s, we found sensi- 
tivity to about 25 eV in the one-mass case. The results 
are also presented in Tables II and III. The advantages 
of the method discussed here, as demonstrated by this 
comparison, are the larger number of events with mass 
and the lower proportion of massless events with a similar 
signature. 

IV. INTERMEDIATE-MASS AND HIGH-MASS 
CASES 

For a low or zero mass (m„ T < 150 eV), the effects on 
the signal S{t) are minimal, and the time-independent 
background is negligible. For an intermediate mass 
(150 eV < m„ T < 1 keV), the effects on S(t) are sub- 



stantial, and the massive component of S(t) will be well- 
separated from the the massless component. The time- 
independent background does not have a large effect, but 
would have to be taken into account. For a large mass 
{ m u T ^ 1 keV), the massive component of S(t) is so de- 
layed and dispersed that its rate is comparable to or be- 
low the time-independent background rate. Given the ac- 
tual data, one can immediately determine which of these 
cases applies. There are analysis techniques that are op- 
timal for each case. It is "fair" to determine the choice 
of technique from the crude characteristics of the data. 

The intermediate-mass case would be rather easy to 
handle. The value of t max would have to be increased 
and the time-independent background rate included. The 
X 2 analysis above was designed to test whether or not a 
mass was necessary to explain the data. For a small 
mass, it can be used to determine that mass. As noted 
earlier, for a large and obvious mass, it would be better 
to revise the x 2 analysis so that the Reference R(t) was 
that appropriate for a given mass. Then the \ 2 could be 
minimized to find the unknown mass and its error. The 
(t) technique requires only the changes noted. For such 
a large mass, the dispersion (broadening) also becomes a 
useful measure of the mass. Almost any technique would 
work in this case since the signal would be so obvious. 

The large-mass case, like the low-mass case, is again 
a marginal analysis since we are by definition looking at 
the limit of detectability. For a large mass, the delays are 
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large compared to the width of the pulse at the source, 
and the integral in Eq. ( |To| ) can be evaluated by assuming 
that the time distribution of the initial pulse is a delta 
function. The scattering rate (per s) is 



dN a , 
dt 



C 
2t 



Ef(E) 



a(E) 



(17) 



and E in MeV is defined 
m^O.hlhD/t (see Eq. (§)), with m in eV, D 



where C is defined in Eq. (|ll] 
as E 

in 10 kpc, and t in s. Note also that / is in MeV -1 . 
The time t is measured from the arrival of the D e events. 
For m = 1 keV, the signal is still several times the time- 
indcpcndcnt background for hundreds of seconds. As the 
mass increases, the height of the signal rate falls very 
quickly. 

Even if S(t) < B at all times, where B is the time- 
independent background rate, it is still possible to deter- 
mine a mass by looking for an excess of counts in some 
long time interval. We assume that the expected num- 
ber of signal events is present (the Poisson fluctuation 
of the signal number will turn out to be a small effect). 
This analysis is therefore more model-dependent than the 
low-mass case, since the number of events enters directly, 
rather than only through the fluctuations. While t is de- 
fined by the arrival of the v e events, they are obviously 
not included in the counts for this analysis. Only a finite 
range of neutrino energies contribute significantly, and 
the largest energy is of order 5 times the smallest. The 
largest delay will thus be of order 25 times the smallest. 
In this case, the simplest and most model-independent 
thing to do is to begin the counting at t = 0. 

We assume that the background rate B is well-known. 
The end of the counting interval t ma x is to be determined. 
The requirement of a statistically significant excess of 
counts is Nb + Ns > Ns+n^Ns, where n is the number 
of sigmas (the number of counts is large enough to treat 
the Poisson distribution as a Gaussian.) Any large excess 
in the number of events will be wholly attributed to the 
signal events, of which there are Ns expected. Using 
Nb = Bt max , this can be rewritten as t max < Ng/n 2 B. 
Note that this is independent of mass. The requirements 
for t m a,x are: 



signal width < i„ 



< 



Aj 
n^B 



(18) 



If the interval is not as wide as the signal, signal events 
will be lost. If it is wider than the signal, too many 
background events will be included. The largest possible 
mass that can be seen with this technique is the one for 
which the signal width is as wide as the right-hand side 
of the equation above. This is 



N s 



i\/Q.515DB 



(19) 



MeV; below that energy, the product f(E)a{E) is es- 
sentially zero. In order to reduce the time-independent 
background rate, we use only the inner 22.5 kton vol- 
ume for this large-mass test, which reduces the number 
of v T signal events to Ns — 250. For this volume, the 
background rate has been measured (ItJ to be of order 
0.1 s _1 . At the three-sigma level (n — 3), the maximum 
detectable mass is then about m max = 9 keV. For this 
mass, even the peak of the signal rate is a factor several 
below the time-independent background rate. Also for 
this mass, t max is of order 10 5 s, so the Poisson error 
on the number of background events is at the 1% level. 
We have assumed that the error on the background rate 
B is not larger than that. The above analysis is opti- 
mized for a flat signal. However, the signal is actually 
peaked at a time smaller than t max , and by increasing 
Bmin to 42 MeV, one still includes about 90% of the sig- 
nal events. While more model-dependent, this increases 
the maximum detectable mass to about m max — 14 keV. 

For comparison, we estimate how large m max would be 
if the signal from neutrino-electron scattering were used. 
Since the signal is forward-peaked, the background can 
be substantially reduced with an angular cut. In this 
case, it makes sense to use the entire 32 kton volume. If 
95% of the background can be removed, and the time- 
independent background rate of 0.1 s _1 used above for 
the inner 22.5 kton can be used for the full volume, then 
B w 0.005 s -1 . Assuming that no signal events are lost 
with this cut, the number of events for E m i n = 5 MeV is 
about Ns = 60. At the three-sigma level, the maximum 
detectable mass is about m max = 2 keV, comparable to 
the estimate in Ref. |n]] . 

If the v r events appear to be missing, a large-mass 
search as above can be made. If nothing is found, there 
are three possibilities. The first possibility is that the 
mass is greater than 10 keV or so, and that it is sta- 
ble over the time it takes to travel from the supernova, 
about 3 x 10 4 years. Then its signal is so dispersed that 
it cannot distinguished against the background. How- 
ever, as pointed out in Ref. |fjj, any neutrino with a mass 
greater than 10 keV or so would likely decay in such a 
time (this avoids violation of the cosmological bound on 
the neutrino masses, see Ref. Jl and references therein). 
The second possibility is that the mass was large enough 
that the neutrinos decayed, and that their decay products 
were not detected. The third possibility is that the v T 
neutrino was not produced in the supernova, or at least 
significantly differently than expected. For example, if 
the v T temperature were much lower than 8 MeV, there 
would be essentially no v T events detected. These three 
possibilities cannot be distinguished without additional 
evidence. 



where m is in eV, E m i n is in MeV, D is in 10 kpc, and B is 
in s~ l . For the v x excitation of 16 0, we take E m i n = 25 
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V. CONCLUSIONS AND DISCUSSION 

One of the key points of our technique is that the abun- 
dant v e events can be used to calibrate the neutrino lu- 
minosity of the supernova and to define a clock by which 
to measure the delay of the v x neutrinos. The internal 
calibration very substantially reduces the model depen- 
dence of our results. The measurement of time relative 
to the v e signal allows us to be sensitive to rather low 
masses. Without such a clock, one cannot determine a 
mass limit with the (t) technique advocated here, since 
the absolute delay is unknown. Instead, one would have 
to constrain the mass from the observed dispersion of the 
events. Our calculations indicate that while a significant 
delay can be seen for m = 50 eV, the dispersion does not 
become significant until m = 150 eV or greater. 

We first assumed that one of and v T masses was 
nonzero, and the other negligibly small. For convenience, 
we referred to the heavier one as v T , though it is impossi- 
ble to tell the difference. The results are given in Figs. 2 
and 3. If it were known that the masses were almost 
degenerate, than a stricter limit can be placed. Those 
results are given in Figs. 4 and 5. If nothing more is 
known, the most conservative thing to do is to take the 
one-mass limit for each of v r and . As shown in Table 
III, if no statistically significant difference of the Refer- 
ence and Signal is seen one can put an upper limit of 45 
eV if one assumes that only one mass in nonvanishing, 
and 35 eV if one assumes that both and v T are massive 
(and that the masses are the same). 

Given the large statistics of the v x signal used here, one 
might wonder why the time delay is not larger and the 
mass sensitivity is not lower than we report here. The 
v x average energy is about 25 MeV. For E 25 MeV, 
m = 50 eV and D = 10 kpc, the delay is about 2 s. 
However, from Eq. (|l^), what matters for the event rate 
is the peak of the product f(E)a(E). Since the cross 
section for the 16 O excitation is very steep in energy, the 
peak energy is large, about 60 MeV. For E « 60 MeV, 
m = 50 eV and D = 10 kpc, the delay is about 0.4 
s. In both cases, these delays are for about 1/3 of the 
events in the Signal, so for a large integration time t max 
the difference (t)s — (t)n would be about 1/3 of these 
delays. For moderate t max , as used in the main analysis, 
the shift is slightly smaller (though more significant than 
for a larger t max ). 

These considerations show that the delay is reduced, 
and the statistical significance decreased, by the seem- 
ingly irreducible background of the v e events at low ener- 
gies as well as by the background caused by the massless 
Vp. Besides, since the energy of the outgoing neutrino 
cannot be measured (or even the excitation energy in 
16 0), it is not possible to measure the energy spectrum 
of the v T neutrinos. Thus the v x temperature can only 
be constrained from the total number of events. 

The situation can be contrasted with the v e mass limit 
of about 20 eV from SN1987A established with only a 



handful of events and no independent clock. There, how- 
ever, it was possible to determine the incoming neutrino 
energy on the event by event basis, and to compare the 
neutrino energies versus time to the theoretical expec- 
tation. Moreover, the SN1987A was at about 50 kpc, 
compared to the 10 kpc assumed for the next Galactic 
supernova, and a lower typical energy should be used in 
the delay formula of the detected v e events than for the 
v x neutral current scattering on 16 O. 

Some of the important parameters used here are not 
well known, though were treated as such. However, once 
there is actually a supernova, the model uncertainties 
will be greatly reduced by the v e data. For example, 
the binding energy Eb and the D e temperature will be 
determined. Other questions that can be resolved include 
the time dependence of the temperature, and whether a 
one-parameter thermal spectrum is sufficient to describe 
the energy spectra. Once a supernova is observed, the 
technique presented here can easily be run with the new 
parameters or necessary modifications. Second, for small 
changes in some parameters, the mass sensitivity does 
not change much. Note that this is especially true if 
mass is unrecognizably small, and we are making a limit. 

The results of this paper are valid for either Dirac or 
Majorana neutrinos. We only considered stable neutri- 
nos. The effects of decaying neutrinos on mass limits 
from supernovae are discussed in Ref ||. We also con- 
sidered unmixed neutrinos. Vacuum oscillations among 
v T , v^, and their antiparticles are irrelevant since the 
numbers of neutrinos of each flavor are assumed to be 
equal. Vacuum oscillations between v T and v e or and 
v e and their antiparticles should have an observable ef- 
fect on the v e spectrum. Oscillations to sterile neutrinos 
would also have an effect. The effects of either vacuum or 
matter-enhanced neutrino mixing on the neutrino signals 
are considered in, e.g., Ref. |22[ . 

In conclusion: We have presented a rather general 
method, including a thorough statistical analysis, of ex- 
tracting information about the possible v T and masses 
from the future detection of a Galactic supernova neu- 
trino burst by the SuperKamiokande detector. When 
such an event in fact occurs, the existing mass limits will 
be vastly improved and will approach, or cross over, the 
cosmological bound. 
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